Enhanced End-of-Turn Detection for Speech to a Personal Assistant
نویسندگان
چکیده
Speech to personal assistants (e.g., reminders, calendar entries, messaging, voice search) is often uttered under cognitive load, causing nonfinal pausing that can result in premature recognition cut-offs. Prior research suggests that prepausal features can discriminate final from nonfinal pauses, but it does not reveal how speakers would behave if given longer to pause. To this end, we collected and compared two elicitation corpora differing in naturalness and task complexity. The Template Corpus (4409 nonfinal pauses) uses keyword-based prompts; the Freeform Corpus (8061 nonfinal pauses) elicits open-ended speech. While nonfinal pauses are longer and twice as frequent in the Freeform data, prepausal feature modelling is roughly equally effective in both corpora. At a response latency of 100 ms, prepausal features modelled by an SVM reduced cut-off rates from 100% to 20% for both corpora. Results have implications for enhancing turn-taking efficiency and naturalness in personal-assistant technology.
منابع مشابه
Persian Adaptation of Enhanced Milieu Teaching for Iranian Children With Expressive Language Delay
Objectives: This study aimed at adapting and examining the applicability of the Teach-Model-Coach-Review model of the enhanced milieu teaching (EMT) approach for improving Iranian mothers’ language strategies while interacting with their toddlers with expressive language delay. Methods: In a single-subject multiple-baseline across-behavior study, the mothers of 3 toddlers with expressive langu...
متن کامل"Where Are the Christmas Decorations?": A Memory Assistant for Storage Locations
natural language understanding, database systems, semi-structured data, speech recognition At Hewlett-Packard Laboratories we want to know how inexpensive it can be to endow mobile personal assistants with the ability to speak naturally with their users. To this end, we are investigating and demonstrating speech-capable mobile personal assistants that can be realized using common off-the-shelf ...
متن کاملMobile Reading Assistant for Blind People
This paper describes an embedded device dedicated for blind or visually impaired people. The main aim of this system is to build an automatic text reading assistant using existing hardware associated with innovative algorithms. A personal digital assistant (PDA) was chosen because it combines small-size, computational resources and low cost price. Three key technologies are necessary: text dete...
متن کاملTeleradiology on a Personal Digital Assistant
This paper describes the porting of a teleradiology system to a Personal Digital Assistant (PDA). The basis for this formed the CHILI teleradiology and PACS system developed by the Steinbeis Transferzentrum Medizinische Informatik, Heidelberg (STZ) in cooperation with the German Cancer Research Center. The work was done as part of a EU IST project called Multimedia Terminal Mobile (MTM). The au...
متن کاملMulti-band summary correlogram-based pitch detection for noisy speech
A multi-band summary correlogram (MBSC)-based pitch detection algorithm (PDA) is proposed. The PDA performs pitch estimation and voiced/unvoiced (V/UV) detection via novel signal processing schemes that are designed to enhance the MBSC’s peaks at the most likely pitch period. These peak-enhancement schemes include comb-filter channel-weighting to yield each individual subband’s summary correlog...
متن کامل